NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Sparse Autoencoders for Hypothesis Generation

Movva, Rajiv; Peng, Kenny; Garg, Nikhil; Kleinberg, Jon; Pierson, Emma (June 2025, International Conference on Machine Learning)

We describe HypotheSAEs, a general method to hypothesize interpretable relationships between text data (e.g., headlines) and a target variable (e.g., clicks). HypotheSAEs has three steps: (1) train a sparse autoencoder on text embeddings to produce interpretable features describing the data distribution, (2) select features that predict the target variable, and (3) generate a natural language interpretation of each feature (e.g., mentions being surprised or shocked) using an LLM. Each interpretation serves as a hypothesis about what predicts the target variable. Compared to baselines, our method better identifies reference hypotheses on synthetic datasets (at least +0.06 in F1) and produces more predictive hypotheses on real datasets (~twice as many significant findings), despite requiring 1-2 orders of magnitude less compute than recent LLM-based methods. HypotheSAEs also produces novel discoveries on two well-studied tasks: explaining partisan differences in Congressional speeches and identifying drivers of engagement with online headlines.
more » « less
Free, publicly-accessible full text available June 18, 2026
Sparse Autoencoders for Hypothesis Generation

Movva, Rajiv; Peng, Kenny; Garg, Nikhil; Kleinberg, Jon; Pierson, Emma (May 2025, ICML)

Free, publicly-accessible full text available May 30, 2026
Generative Artificial Intelligence in Medicine

https://doi.org/10.1146/annurev-biodatasci-103123-095332

Shanmugam, Divya; Agrawal, Monica; Movva, Rajiv; Chen, Irene Y; Ghassemi, Marzyeh; Jacobs, Maia; Pierson, Emma (March 2025, Annual Review of Biomedical Data Science)

The increased capabilities of generative artificial intelligence (AI) have dramatically expanded its possible use cases in medicine. We provide a comprehensive overview of generative AI use cases for clinicians, patients, clinical trial organizers, researchers, and trainees. We then discuss the many challenges—including maintaining privacy and security, improving transparency and interpretability, upholding equity, and rigorously evaluating models—that must be overcome to realize this potential, as well as the open research directions they give rise to.
more » « less
Free, publicly-accessible full text available March 18, 2026
Using Large Language Models to Promote Health Equity

https://doi.org/10.1056/AIp2400889

Pierson, Emma; Shanmugam, Divya; Movva, Rajiv; Kleinberg, Jon; Agrawal, Monica; Dredze, Mark; Ferryman, Kadija; Gichoya, Judy Wawira; Jurafsky, Dan; Koh, Pang Wei; et al (January 2025, NEJM AI)

Free, publicly-accessible full text available January 23, 2026
Using unlabeled data to enhance fairness of medical AI

https://doi.org/10.1038/s41591-024-02892-0

Movva, Rajiv; Koh, Pang Wei; Pierson, Emma (April 2024, Nature Medicine)

Full Text Available
Coarse race data conceals disparities in clinical risk score performance

Movva, Rajiv; Shanmugam, Divya; Hou, Kaihua; Pathak, Priya; Guttag, John; Garg, Nikhil; Pierson, Emma (August 2023, MLHC)

Full Text Available

Search for: All records